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Abstract 



. We study integrality gaps and approximability of two closely related problems on directed 

^| ' graphs. Given a set V of n nodes in an underlying asymmetric metric and two specified nodes 

s and i, both problems ask to find an s-t path visiting all other nodes. In the asymmetric 
' traveling salesman path problem (ATSPP), the objective is to minimize the total cost of this 

path. In the directed latency problem, the objective is to minimize the sum of distances on this 
path from s to each node. Both of these problems are NP-hard. The best known approximation 
algorithms for ATSPP had ratio O(logn) [TIE] until the very recent result that improves it to 
^3 ' O(logn/loglogn) [2111]. However, only a bound of 0(y/n) for the integrality gap of its linear 

s ! ■ programming relaxation has been known. For directed latency, the best previously known 

approximation algorithm has a guarantee of 0(n 1 / 2+e ), for any constant e > [25] . 

We present a new algorithm for the ATSPP problem that has an approximation ratio of 



C***~ ' O(logn), but whose analysis also bounds the integrality gap of the standard LP relaxation of 



ATSPP by the same factor. This solves an open problem posed in [7]. We then pursue a deeper 
study of this linear program and its variations, which leads to an algorithm for the fc-person 
ATSPP (where k s-t paths of minimum total length are sought) and an 0(log n)-approximation 
for the directed latency problem. 

X" 

1 Introduction 

Let G = (V, E) be a complete directed graph on a set of n nodes and let d : E — > M + be a cost 
function satisfying the directed triangle inequality d uw < d uv + d vw for all u,v,w G V. However, 
d is not necessarily symmetric: it may be that d uv ^ d vu for some nodes u, v € V. In the metric 
Asymmetric Traveling Salesman Path Problem (ATSPP), we are also given two distinct nodes 
s,t £ V. The goal is to find a path s = vx, V2, ■ ■ ■ , v n = t that visits all the nodes in V while 
minimizing the sum X^?=i d VjVj+1 - ATSPP can be used to model scenarios such as minimizing the 
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total cost of travel for a person trying to visit a set of cities on the way from a starting point to 
a destination. This is a variant of the classical Asymmetric Traveling Salesman Problem (ATSP), 
where the goal is to find a minimum-cost cycle visiting all nodes. In the fc-person ATSPP, given an 
integer k > 1, the goal is to find k paths from s to t such that every node is contained in at least 
one path and the sum of path lengths is minimized. 

Related to ATSPP is the directed latency problem. On the same input, the goal is to find a 
path s = v\, V2, ■ ■ ■ i v n = t that minimizes the sum of latencies of the nodes. Here, the latency of 
node Vi in the path is defined as X^'=i d Vj vj +1 ■ The objective can be thought of as minimizing 
the total waiting time of clients or the average response time. There are possible variations in the 
problem definition, such as asking for a cycle instead of a path, or specifying only s but not t, 
but they easily reduce to the version that we consider. Other names used in the literature for this 
problem are the deliveryman problem [23] and the traveling repairman problem pQ. 

1.1 Related work 

Both ATSPP and the directed latency problem are closely related to the classical Traveling Sales- 
man Problem (TSP), which asks to find the cheapest Hamiltonian cycle in a complete undirected 
graph with edge costs [151121] , In general weighted graphs, TSP is not approximable. However, 
in most practical settings it can be assumed that edge costs satisfy the triangle inequality (i.e. 
d U w < d uv +d vw ). Though metric TSP is still NP-hard, the well-known algorithm of Christofides [8] 
has an approximation ratio of 3 /2. Later the analysis in |27tl29] showed that this approximation 
algorithm actually bounds the integrality gap of a linear programming relaxation for TSP known 
as the Held-Karp LP. This integrality gap is also known to be at least 4 / 3 - Furthermore, for all 
e > 0, approximating TSP within a factor of 220 /2i9 — e is NP-hard [26]. Christofides' heuristic 
was adapted to the problem of finding the cheapest Hamiltonian path in a metric graph with an 
approximation guarantee of 3 /2 if at most one endpoint is specified or 5 /3 if both endpoints are 
given [16]. 

In contrast to TSP, no constant-factor approximation for its asymmetric version is known. The 
current best approximation for ATSP is the very recent result of Asadpour et al. [3], which gives 
an 0(logn/loglogn)-approximation algorithm. It also upper-bounds the integrality gap of the 
asymmetric Held-Karp LP relaxation by the same factor. Previous algorithms guarantee a solution 
of cost within O(logn) factor of optimum [9"l llll[T8|ll9j . The algorithm of Frieze et al. [11] is shown 
to upper-bound the Held-Karp integrality gap by log 2 n in [28], and a different proof that bounds 
the integrality gap of a slightly weaker LP is obtained in [23] . The best known lower bound on the 
Held-Karp integrality gap is essentially 2 [5], and tightening these bounds remains an important 
open problem. ATSP is NP-hard to approximate within 117 /ii6 — e [26]. 

The path version of the problem, ATSPP, has been studied much less than ATSP, but there 
are some recent results concerning its approximability. An 0(y/n) approximation algorithm for it 
was given by Lam and Newman [20], which was subsequently improved to O(logn) by Chekuri 
and Pal [7]. Feige and Singh [9] improved upon this guarantee by a constant factor and also 
showed that the approximability of ATSP and ATSPP are within a constant factor of each other, 
i.e. an a-approximation for one implies an 0(a)-approximation for the other. Combined with the 
result of [3], this implies an 0(logn/loglogn) approximation for ATSPP. However, none of these 
algorithms bound the integrality gap of the LP relaxation for ATSPP. This integrality gap was 
considered by Nagarajan and Ravi [25], who showed that it is at most 0{^/n). To the best of our 
knowledge, the asymmetric path version of the fc-person problem has not been studied previously. 
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However, some work has been done on its symmetric version, where the goal is to find k rooted 
cycles of minimum total cost (e.g., |12j). 

The metric minimum latency problem is NP-hard for both the undirected and directed versions 
since an exact algorithm for either of these could be used to efficiently solve the Hamiltonian Path 
problem. The first constant-factor approximation for minimum latency on undirected graphs was 
developed by Blum et al. [4]. This was subsequently improved in a series of papers from 144 to 
21.55 [13], then to 7.18 [2] and ultimately to 3.59 [6]. Blum et al. [3] also observed that there is 
some constant c such that there is no c-approximation for minimum latency unless P = NP. For 
directed graphs, Nagarajan and Ravi [25] gave an 0((p + log re) n e e~ 3 ) approximation algorithm 
that runs in time n°( l l e \ where p is the integrality gap of an LP relaxation for ATSPP. Using their 
0(y/n) upper bound on p, they obtained a guarantee of 0(re 1//2+e ), which is the best approximation 
ratio known for this problem before our present results. 

1.2 Our results 

In this paper we study both the ATSPP and the directed latency problem. The natural LP 



min d e x e (1) 

s.t. x(5 + (u)) =x(5-(u)) VreGF\{s,t} (2) 

x(6 + (s)) = x(6-(t)) = l (3) 

x(S-(s)) = x(5 + (t)) = (4) 

x{8~(S))>a VS CV,S ^®,s S (5) 

x e > Ve€£ 



relaxation for ATSPP is ([I]) with a = 1, where 5 + (-) denotes the set of outgoing edges from a 
vertex or a set of vertices, and S~(-) denotes the set of incoming edges. A variable x e indicates that 
edge e is included in a solution. Let us refer to this linear program as LP (a), as we study it for 
different values of a in constraints ([5]). We begin in Section [2] by proving that the integrality gap 
of LP(a = 1) is O(logre). 

Theorem 1.1 If L is the cost of a feasible solution to LP |7P with a = 1, then one can find, in 
polynomial time, a Hamiltonian path from s to t with cost at most (2 log 

We note that, despite bounding the integrality gap, our algorithm is actually combinatorial and 
does not require solving the LP. We strengthen the result of Theorem 11.11 by extending it to any a 
with ^ < q < 1. This captures the LP of [25], which has a = |, and is also used in our algorithm 
for the directed latency problem. We prove the following theorem in Section [3j 

Theorem 1.2 If L is the cost of a feasible solution to LP |7p with | < a < 1, then one can find, 
in polynomial time, a Hamiltonian path from s to t with cost at most 6 ig&gij . L. 

1 A\\ logarithms in this paper are base 2. 
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It is worth observing that this theorem, together with the results of [25], imply a polylogarithmic 
approximation algorithm for the directed latency problem which runs in quasi-polynomial time, 
as well as a polynomial-time 0(n ,E )-approximation. However, that approach relies on guessing a 
large number of intermediate vertices of the path, and thus does not yield an algorithm that has 
both a polynomial running time and a polylogarithmic approximation guarantee. So, to obtain a 
polynomial-time approximation, we use a different approach. For that we consider LP (a) for values 
of a that include a < \. If we allow a < \ then LP(a), as a relaxation of ATSPP, can be shown 
to have an unbounded integrality gap. However, we prove the following theorem in Section [U 

Theorem 1.3 If L is the cost of a feasible solution to LP (CP with a = \, for integer k, then one 
can find, in polynomial time, a collection of at most k ■ log n paths from s to t, such that each vertex 
of G appears on at least one path, and the total cost of all these paths is at most kLlogn. 

Next, we study another generalization of the ATSPP, namely the /c-person asymmetric traveling 
salesman path problem. In Section [5] we prove the following theorem: 

Theorem 1.4 There is an 0(k 2 logn) approximation algorithm for the k-person ATSPP. More- 
over, the integrality gap of its LP relaxation is bounded by the same factor. 

Given these results concerning LP (a), we study a particular LP relaxation for the directed 
latency problem in Section [H We improve upon the 0(n 1 / 2+e )-approximation of |25j substantially 
by proving the following: 

Theorem 1.5 A solution to the directed latency problem can be found in polynomial time that has 
cost no more than O(logn) • L, where L is the value of LP relaxation which is also a lower 
bound on the integer optimum. 

We note that this seems to be the first time that a bound is placed on the integrality gap of 
any LP relaxation for the minimum latency problem, even in the undirected case. 

2 Integrality gap of ATSPP 

We show that LP relaxation (TTJ) of ATSPP with a = 1 has integrality gap of O(logn). Let x* be 
its optimal fractional solution, and let L be its cost. We define a path-cycle cover on a subset of 
vertices W C V containing s and t to be the union of one s-t path and zero or more cycles, such 
that each v G W occurs in exactly one of these subgraphs. The cost of a path-cycle cover is the 
sum of costs of its edges. 

Our approach is an extension of the algorithm by Frieze et al. |11| . analyzed by Williamson |28j 
to bound the integrality gap for ATSP. That algorithm finds a minimum-cost cycle cover on the 
current set of vertices, chooses an arbitrary representative vertex for each cycle, deletes other 
vertices of the cycles, and repeats, at the end combining all the cycle covers into a Hamiltonian 
cycle. As this is repeated at most logn times, and the cost of each cycle cover is at most the cost 
of the LP solution, the upper bound of log n on the integrality gap is obtained. In our algorithm 
for ATSPP, the analogue of a cycle cover is a path-cycle cover (also used in [20]), whose cost is 
at most the cost of the LP solution (Lemma I2.2p . At the end we combine the edges of O(logn) 
path-cycle covers to produce a Hamiltonian path. However, the whole procedure is more involved 
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than in the case of ATSP cycle. For example, we don't choose arbitrary representative vertices, 
but use an amortized analysis to ensure that each vertex only serves as a representative a bounded 
number of times. 

We note that a path-cycle cover of minimum cost can be found by a combinatorial algorithm, 
using a reduction to minimum-cost perfect matching, as explained in |20j . In the proof of Lemma 
12.21 below, we make use of the following splitting-off theorem, as also done in [24], where splitting 
off edges yv and vx refers to replacing these edges with the edge yx (unless y = x, in which case 
the two edges are just deleted). 

Theorem 2.1 (Frank [10J and Jackson [1TJ ) Let G = (V,E) be a Eulerian directed graph and 
vx G E. There exists an edge yv G E such that splitting off yv and vx does not reduce the directed 
connectivity from u to w for any u,w G V \ {v}. 

This theorem also applies to weighted Eulerian graphs, i.e. ones in which the weighted out-degree 
of every vertex is equal to its weighted in-degree, since weighted edges can be replaced by multiple 
parallel edges, producing an unweighted Eulerian multigraph. 

Lemma 2.2 For any subset W C V that includes s and t, there is a path-cycle cover of W of cost 
at most L. 

Proof. Consider the graph G' obtained from G by assigning capacities x* to edges e G E and 
adding a dummy edge from t to s with unit capacity. From constraints (J2])-(j4|) of LP(a = 1) it 
follows that this is a weighted Eulerian graph. Constraints (JS|) and the max-flow min-cut theorem 
imply that for any v G V, the directed connectivity from s to v in G' is at least a = 1. 

We apply the splitting-off operation on G', as guaranteed by Theorem 12. 11 to vertices in V\ W 
until all of them are disconnected from the rest of the graph. Let G" be the resulting graph on 
W and let x' be its edge capacities except for the dummy edge ts (which was unaffected by the 
splitting-off process). By Theorem 12.11 the directed connectivity from s to any v G W does not 
decrease from the splitting-off operations, which means that in G" it is still at least a. This ensures 
that x' satisfies constraints ([5]) for all sets S C W with S 7^ and s ^ S, and is a feasible solution 
to LP (a) on the subset W of vertices. Furthermore, the triangle inequality implies that the cost of 
x' is no more than that of x*, namely L. 

Now we make the observation that if we remove from LP(a = 1) constraints ([5]) for all but 
singleton sets, the resulting LP is equivalent to a circulation problem, and thus has an integer 
optimal solution. Since there is a feasible solution to LP(a = 1) on the set W of cost at most L 
(namely x'), and removing a constraint can only decrease the optimal objective value, it means 
that there is an integer solution to the following program that costs no more than L: 

(6) 

x(<T(«))>l VueW\{s,t} 
x(S-(t)) = l (7) 
x(5 + (t)) = 

Ve G E 

In principle, this integer solution can have x(5 + (u)) > 1 for some nodes u. In this case, we find a 
Euler tour of each component of the resulting graph (with dummy edge ts added in) and shortcut 



min d e x e 

s.t. x(5 + (u)) = 
x(5 + (s)) = 
x(5'(s)) = 
x e >0 
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it over any repeated vertices. This ensures that x(6 + (u)) = 1 for all u without increasing the cost. 
But such a solution is precisely a path-cycle cover of W. □ 



Algorithm 1 Asymmetric Traveling Salesman Path 



Let a set W <— V; integer labels l v <— for all v £ V; flow F <— and circulation H <— 
for 2 log 2 n + 1 iterations do 

Find the minimum-cost s-t path-cycle cover F' on W 

F <— F + F' t> P is acyclic before this operation 

Find a path-cycle decomposition of F, with cycles C\...Ck and paths P\...Ph, such that 
(Jj -Pj is acyclic 

for each connected component A of (J • Cj do > A is a circulation 

For each vertex let d u be the in-degree of u in ^4 

Find a "representative" node w G A minimizing l v + cZ„ 

F <— F — A > subtract flows 

for each w £ w / v, and for each path Pj 

if it! E Pj then modify P by shortcutting Pj over w 
Remove all nodes in A, except v, from W > Note: they don't participate in P anymore 
H <— H + A > add circulations 

end for 
end for 

Let P be an s-t path consisting of nodes in W in the order found by topologically sorting P 

> P is an acyclic flow on the nodes W 
for every connected component X of H of size \X\ > 1 do 

Find a Euler tour of A", shortcut over nodes that appear more than once 

Incorporate the resulting cycle into P using a shared node 
end for 

return P > P is a Hamiltonian s-t path 



We consider Algorithm [TJ Roughly speaking, the idea is to find a path-cycle cover, select a rep- 
resentative node for each cycle, delete the other cycle nodes, and repeat. Actually, a representative 
is selected for a component more general than a simple cycle, namely a union of one or more cycles. 
We ensure that each vertex is selected as a representative at most logn times, which means that af- 
ter 2 log n + 1 iterations, each surviving vertex has participated in the acyclic part of the path-cycle 
covers at least logn + 1 times. This allows us at the end to find an s-t path which spans all the 
surviving vertices, W, and consists entirely of edges in the acyclic part, P, of the union of all the 
path-cycle covers, using a technique of |25j . Then we insert into it the subpaths obtained from the 
cyclic part, H, of the union of path-cycle covers, connected through their representative vertices. 
We occasionally treat subgraphs satisfying appropriate degree constraints as flows or circulations. 

Lemma 2.3 During the course of the algorithm, no label l v exceeds the value logn. 

The idea of the proof is, as the algorithm proceeds, to maintain a forest on the set of nodes V, 
such that the number of leaves in a subtree rooted at any node v G V is at least 2 lv . The lemma 
then follows because the total number of leaves is at most n. We first prove an auxiliary claim. 
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Claim 2.4 In each component A found by Algorithm^ on line® there are two distinct nodes x 
and y such that d x = d y = 1. 

Proof. Let F be the value of F at the start of the current iteration of the outside loop, i.e. 
before F' is added to it on line HI F is acyclic, because during the course of the loop, all cycles of 
F are subtracted from it. So A is a union of cycles, formed from the sum of an acyclic flow F and 
a path-cycle cover F' , which sends exactly one unit of flow through each vertex. 

Consider a topological ordering of nodes based on the flow F, and let x and y be the first and 
last nodes of A, respectively, in this ordering. As A always contains at least two nodes, x and y 
are distinct. Since x and y participate in some cycle(s) in A, their in-degrees are at least 1. We 
now claim that the in-degree of x in A is at most 1. Indeed, since all other nodes of A are later 
than x in the topological ordering, it cannot have any flow coming from them in F. So the only 
incoming flow to x can be in F' . But since F' sends a flow of exactly one unit through each vertex, 
the in-degree of x in A is at most one. A symmetrical argument can be made for y, showing that 
its out-degree in A is at most one. But since A is a union of cycles, every node's in-degree is equal 
to its out-degree, and the in-degree of y is also at most 1. □ 

Proof of Lemma 12.31 As the algorithm proceeds, let us construct a forest on the set of nodes V. 
Initially, each node is the root of its own tree. We maintain the invariant that W is the set of tree 
roots in this forest. For each component A that the algorithm considers, and the node v found on 
line El we attach the nodes of A, except v, as children of v. Note that the invariant is maintained, 
as these nodes are removed from W on line [12j The set of nodes of each component A found on 
line [6] is always a subset of W, and thus our construction indeed produces a forest. 

We show by induction on the steps of the algorithm that if a node has label I, then its subtree 
contains at least 2 l leaves. Thus, since there are n nodes total, no label can exceed log 2 n. At the 
beginning of the algorithm, all labels are 0, and all trees have one leaf each, so the base case holds. 
Now consider some iteration in which the label of vertex v S A is increased from l v to l v + d v . By 
Claim [231 there are nodes x,y £ A (possibly one of them equal to v) with d x = d y = 1. Since v 
minimizes l u + d u among all vertices u 6 A, we have that l x + d x > l v + d v and l y + d y > l v + d v , 
and thus l x > l v + d v — 1 and l y > l v + d v — 1. Thus, by the induction hypothesis, the trees rooted 
at x and y each have at least 2 lv+dv ~ 1 leaves. Because we update the forest in such a way that v's 
new tree contains all the leaves of trees previously rooted at x and y, this tree now has at least 
2 . = 2 lv+dv leaves. □ 

Lemma 2.5 At the end of the algorithm's main loop, the flow in F passing through any node 
v £ W is equal to 21ogn + 1 — l v , and thus (by Lemma [273\) is at least logn + 1. 

Proof. There are 2 logn + 1 iterations, each of which adds one unit of flow through each vertex 
v £ W. We now claim that for a vertex v G W, the amount of flow removed from it is equal to its 
label, l v . Flow is removed from v only if v becomes part of some component A. Now, if it is ever 
part of A, but not chosen as a representative on line (H then it is removed from W. Thus, we are 
only concerned about vertices that are chosen as representatives every time that they are part of 
A. Such a vertex has flow d v going through it in A, which is the amount subtracted from F. But 
since this is also the amount by which its label increases, the lemma follows. □ 

We now show that Algorithm [T] returns a Hamiltonian s-t path of cost at most (2 log 2 n + l)-L. 

Proof of Theorem II. 1L At the end of the main loop, all nodes of V are part of either W or H 
or both. So when all components of H are incorporated into the path P, all nodes of V become 
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part of the path. We bound the cost of all the edges used in the final path by the total cost of all 
the path-cycle covers found on line [3] of the algorithm. We note that at the end of the algorithm, 
the cost of the flow F + H is no more than this total. 

We claim that when the s-t path P is found on line \T7\ F contains flow on every edge between 
consecutive nodes of P. This is similar to an argument used in [25]. First, since F is acyclic, it has 
a topological ordering. Suppose we find a flow decomposition of F into paths. There are at most 
21ogn + 1 such paths, and, by Lemma 12.51 each vertex of W participates in at least logn + 1, or 
more than half, of them. This means that any two vertices u, v S W must share a path, say P', 
in this decomposition. In particular, suppose that v immediately follows u in the path P. This 
means that v appears later than u in the topological order, so on P' v comes after u. Moreover, 
we claim that on P', v will be the immediate successor of u. If not, suppose that there is a node w 
that appears between u and v in P'. But this means that in the topological ordering (and thus in 
P), w will appear after u and before v, which contradicts the fact that they are consecutive in P. 
So we conclude that there is an edge with flow in F between any two consecutive nodes of P, and 
thus the path P costs no more than the flow F. 

Regarding H, we note that it is a sum of cycles, and thus Eulerian. So it is possible to find a 
Euler tour of each of its components, using only edges with flow in H. The subsequent shortcutting 
can only decrease the cost. Thus, the total cost of cycles found on line [19] is no more than the cost 
of the flow H. To describe how these cycles are incorporated into the path P, we show that each 
of them (or, equivalently, each connected component of H) shares exactly one node with W (and 
thus with P). Note that every component A added to H contains only nodes that are in W at that 
time. Moreover, when this is done, all but one nodes of A are expelled from W. So when several 
components of H are connected by the addition of A, the invariant is maintained that there is one 
node per component that is shared with W. Now, suppose that v is the vertex shared by the cycle 
obtained from component X and the path P. On line [20] we incorporate the cycle into the path 
by following the path up to v, then following the cycle up to the predecessor of v, then connecting 
it to the successor of v on the path. By triangle inequality, the resulting longer path costs no more 
than the sum of costs of the old path and the cycle. □ 

3 Integrality gap for relaxed ATSPP LP 

Consider LP (a) with | < a < 1, and say that it has cost L. We bound its integrality gap for 
ATSPP. As in the proof of Lemma 12. 2\ we can apply splitting-off to obtain a feasible solution to 
LP(a), of cost at most L, on a graph induced by a subset of vertices W C V. Let x be such a 
solution. Lemma 13.11 below shows how to use x to find a feasible fractional solution to LP ([6]) on 
W, of cost within a constant factor of L, namely 2q 3 _ 1 L. Since LP ([6]) has integer optimum, there 
is an integer solution to LP ([6]) on W, and thus a path-cycle cover, of cost at most 2a 3 1 L. Then 
we can proceed as in Section [2] applying Algorithm [1] to bound the cost of the resulting ATSPP 
solution by 2 logn + 1 times the path-cycle cover cost. This shows that LP(a) has integrality gap 
at most 61 °g"+ 3 , proving Theorem O 

Lemma 3.1 Given a solution x to LP (a), with a > 1/2, on a subset W C V, with cost at most 
L, a feasible solution to LP (Ej) on W of cost at most „ 3 _± L can be found. 

Proof. Multiply x by 1/a. Now it constitutes a flow F of 1/a units from s to t. Constraints ([5]), 
restricted to sets of size 1, imply that each node u now has at least one unit of flow going through 



S 



it. Find a flow decomposition of F into paths and cycles, so that the union of the paths is acyclic. 
Let F = F p + F c , where F p is the sum of flows on the paths in our decomposition, and F c is the 
sum of flows on the cycles. 

Choose some 7 such that ^- < 7 < 1. For any node u such that the amount of F p flow going 
through u is less than 7, shortcut any flow decomposition paths that contain u, so that there is no 
more F p flow going through u. Let U C W be the set of vertices still participating in the F p flow. 
Then each vertex in U has at least 7 units of F p flow going through it, and each vertex in W \ U 
has at least 1 — 7 units of F c flow going through it. 

We find a topological ordering of vertices in U according to F p (which is acyclic), and let P 
be an s-t path that visits the nodes of U in this topological order. We claim that the cost of P is 
within a constant factor of the cost of F p . The argument for this is similar to one in the proof of 
Theorem 11.11 Out of 1/a units of flow going from s to t in F p , each vertex u £ U carries 7 units, 
which is more than half of the total amount (as 7 > l/2a). So for any two such vertices u and v, 
there must be shared flow paths that carry flow of at least 27 — 1/a units. In particular, for every 
two consecutive nodes u,v 6 P, F p must contain such shared paths in which v immediately follows 
u. So the cost of P is at most 2 -y-i/ a times the cost of F p . 

We now define x as a flow equal to one unit of s-t flow on the path P plus times the flow 
F c . We claim that x is a feasible solution to LP ([6]): there is exactly one unit of flow from s to t 
(as F c consists of cycles not containing s or t); there is flow conservation at all nodes except s and 
t; each vertex in U (and thus in P) has at least one unit of flow going through it; and each vertex 
in W \ U has at least one unit of flow going through it (as it had at least 1 — 7 units of F c flow) . 
The cost of this solution is at most 

cost(F p ) + • cost(F c ) < max ( — — , J • — L. 



27 — 1/a 1 — 7 V27— 1/a'l — j J a 

= | + 3^, which satisfies ^ < 7 < 1, we see that the cost of x is at most ^ 



4 Relaxed ATSPP LP with a < 1/2 

Consider LP CO) with a = \ < \ for some integer k > 2. It can be shown that, as a relaxation for 
the ATSPP problem, this LP has unbounded integrality gap. For example, let D be an arbitrarily 
large value and consider the shortest path metric obtained from the graph in Figure [TJ One can 
verify that the following assignment of x-values to the arcs is feasible for LP ([T]) with a = 1 /2. 
Assign a value of 1 /2 to arcs (1,2), (3,2), (3,6), (1,4), (5,4), and (5,6) and a value of 1 to arcs 
(2, 3) and (4, 5). Every other arc is assigned a value of 0. This assignment is feasible for the linear 
program and has objective function value 5. On the other hand, any Hamiltonian path from 1 to 
6 has cost at least D. 

Let L be the cost of the optimal solution to LP (a = r). We show how to find k-log n paths from 
s to t, such that each node of G appears on at least one path, and the total cost of all these paths 
is at most felogn • L. Let us define a fc-path-cycle cover to be a set of k disjoint paths from s to t 
and zero or more cycles, which together cover all nodes. Like a path-cycle cover, the minimum-cost 
/c-path-cycle cover can be found by a combinatorial algorithm by creating k copies of both s and t 
and using the matching algorithm described in |20j . 

Lemma 4.1 For any subset W C V that includes s and t, there is a k-path-cycle cover ofW with 
total cost at most kL. 
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Figure 1: Bad gap example for LP (pQ) with a = l /2. Here, D is an arbitrarily large integer. 



Proof. As in the proof of Lemma 12.21 we a PPly splitting-off to a solution of LP (a) to get a 
solution to LP(a) on a subset W C V of vertices, of no greater cost. Now, if we multiply this 
solution by k, we get a feasible solution to LP ([6]) with constraints (JT]) replaced with 

x(5 + (s)) = x{6~{t)) = k. 

The cost of this solution is no more than kL. But this LP has an integer optimum, which, possibly 
after shortcutting, is exactly a /c-path-cycle cover. □ 

Proof of Theorem 11.31 We start with W = V and repeat the following until W = {s, t}: 

1. Find a /c-path-cycle cover F of W of cost at most kL, as guaranteed by Lemma [4.11 

2. Remove from W all nodes (except s and t) that participate in paths of F. 

3. For each cycle C of F, choose a representative node v G C, and remove from W all nodes of 
C except v. 

Let us say that the procedure terminates after T iterations. As the size of W halves in each 
iteration, T is at most log n. In the last iteration, all elements of W must have participated in the 
paths, as otherwise there would be a node v that remains in W after this iteration. This implies 
that the graph [JF is connected. It also has total cost at most kLlogn, and, if we add kT edges 
from t to s, becomes Eulerian. This means that we can construct kT < klogn paths from s to t, 
covering all nodes of V, out of edges of (J F. □ 

5 Algorithm for /c-person ATSPP 

In this section we consider the /c-person asymmetric traveling salesman path problem. The LP 
relaxation for this problem is similar to LP ([1]), but with x{5 + {s)) = x(5~(t)) = k for constraint ([3]) 
and x(6~(S)) > 1 for constraint ([5]). Arguments similar to those in Section [2] show that a fe-path- 
cycle cover on any subset W is a lower bound on the value of the LP relaxation for the /c-person 
ATSPP. Our algorithm constructs a solution that uses each edge of (9(A;logn) fc-path-cycle covers 
at most k times, proving a bound of 0(k 2 logn) on the approximation ratio and the integrality gap. 

Our algorithm starts by running lines [TlfT6l of Algorithm [H except with T = (k + l)logn + 1 
iterations of the loop and finding minimum-cost fe-path-cycle covers instead of the path-cycle covers 
on line [3l Then it finds k s-t paths in the resulting acyclic graph F, satisfying conditions of 
Lemma 15. II below. The algorithm concludes by incorporating each component of the circulation H 
into one of the obtained paths, similarly to lines [T81I2T1 of Algorithm [TJ 
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Lemma 5.1 After lines [WTb] of Algorithm^ are executed with T iterations of the loop on line® 
and finding minimum- cost k-path- cycle covers on line EJ there exist k s-t paths in the resulting 
acyclic graph F, such that each edge of F is used at most k times, and every node of F is contained 
in at least one path. Moreover, these paths can be found in polynomial time. 

Proof. We prove the existence result first. We note that the graph F can support kT units 
of flow from s to t. This is because in each of the T iterations, k s-t paths were added to the 
graph, whereas the removal of cycles does not decrease the amount of flow supported. So F can 
be decomposed into kT edge-disjoint paths from s to t. Moreover, each node of F participates in 
at least T — log n of these paths by Lemma 12.51 

Let K be the comparability graph obtained from F. Namely, K has the same set of nodes 
as F, and there is an undirected edge between nodes u and v in K if F contains a directed path 
either from it to v or from u to it. We claim that the nodes of K can be partitioned into at most k 
cliques. As comparability graphs are perfect graphs |14| . the minimum number of cliques that K 
can be partitioned into is equal to the size of the maximum independent set in K. So suppose, for 
the sake of contradiction, that K contains an independent set I of size k + 1. Then no nodes in I 
must share any of the paths in our decomposition of F. Since each appears on at least T — log n 
paths, there must be at least (k + 1) • (T — logn) paths total in the decomposition. However, this 
is a contradiction, since (k + 1) • (T — log n) = (k + l)k log n + k + 1 > (k + l)k log n + k = kT. 

Given the set of k cliques in K, we can convert each of them into an s-t path in F. For each 
such clique C, order the nodes of C in a way consistent with a topological ordering of F. Then F 
contains a path from each node u of C to the next node v. the existence of a uv edge in K shows 
that there is a path between these nodes in F, and the topological ordering guarantees that this 
path goes in the correct direction. There must also be paths from s to the first node of C as well 
as from the last node of C to t, since s is the only source and t is the only sink of F. Furthermore, 
the acyclicity of F shows that each edge of F is used at most once for connecting nodes in C. As 
there are k cliques, each edge is used at most k times in total. If the number of cliques is smaller 
than k, then we can add arbitrary paths from F to our collection to obtain exactly k paths. 

To find the required paths algorithmically, we construct the following bipartite graph B from 
F with bipartitions X and Y. For each v £ F \ {s,t}, add nodes x v to X and y v to Y (note that 
\X\ = \Y\). Now, for each ordered pair of nodes (u,v) such that there is a directed path from it 
to v in F, add an edge from x u to y v . Let K' be the directed graph obtained from K — {s,t} by 
orienting each edge uv according to flow F. It is easy to see that any collection of q disjoint cliques 
covering K — {s, t} corresponds to a covering of K' by q vertex-disjoint paths and vice-versa. We 
claim that there is a covering of K' by q disjoint paths if and only if there is a matching in B that 
leaves only q nodes in X and q nodes in Y unmatched. 

Consider a covering of K' by q vertex-disjoint paths P. Form a matching M — {xuUv '■ 
uv used by P}. Since P is a collection of vertex-disjoint paths, each node has indegree and outde- 
gree at most one in P, so M is a valid matching. Furthermore, since P covers the nodes of K' with 
q paths, then exactly q nodes have indegree and exactly q nodes have outdegree 0. Thus, in M 
exactly q nodes of X and q nodes of Y are unmatched. Conversely, let M be a matching of B that 
leaves exactly q nodes of X unmatched. For each unmatched node y u S Y, form a directed path 
in K' starting at it by the following process. If x u is matched to, say, y v , then add arc uv to the 
path. Continue this process from y v until the corresponding node in X is not matched. This will 
form q vertex-disjoint paths in K' that cover all nodes. 

As noted earlier, we know that K can be covered by at most k disjoint cliques. Therefore, there 
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is a matching in B that leaves at most k nodes in X and k nodes in Y unmatched. Compute such 
a matching and map it to the corresponding paths in K' . Finally, extend these at most k paths to 
s — t paths by adding an arc from s to the start of each path and an arc from the end of each path 
to t. Again, since the only source in F is s and the only sink in F is t, these arcs correspond to 
edges in K. □ 

Proof of Theorem 11.41 Let L be the cost of a linear programming relaxation for the problem. 
The edges of F as well as the edges used to connect the eulerian components of H to the paths come 
from the union of T /c-path-cycle covers on subsets of V, and thus cost at most T L = 0(k log re) -L. 
However, the algorithm may use each edge of F up to k times in the paths of Lemma 15. H which 
makes the total cost of the produced solution at most 0(k 2 logn) • L. □ 



6 Approximation algorithm for Directed Latency 

We introduce LP relaxation (jSJ) for the directed latency problem. The use of x uw and f£ w variables 
in an integer programming formulation for directed latency was proposed by Mendez-Diaz et al. 
[22], who do a computational evaluation of the strength of its LP relaxation. However, our LP 
formulation uses a different set of constraints than the one in |22j . 

In LP ([8]), a variable x uw indicates that node u appears before node w on the path. Similarly, 
x uvw for three distinct nodes u, v, w indicates that they appear in this order on the path. For every 
node v ^ s, we send one unit of flow from s to v, and we call it the v-fiaw. Then f" w is the amount 
of f-flow going through edge (u,w), and £(v) is the latency of node v. To show that this LP is a 
relaxation of the directed latency problem, given a solution path P, we can set f£ w = 1 whenever 
the edge (u, w) is in P and v occurs later than u in P, and f£ w = otherwise. So f v is one unit 
of flow from s to v along the path P. Also setting the ordering variables x uw and x uvw to or 1 
appropriately and setting £(v) to the latency of v in P, we get a feasible solution to LP ([8]) of the 
same cost as the total latency of P. 

Constraints (|12p are the flow conservation constraints for u-flow at node u. Constraints (|13p 
ensure that no v-Qow enters s or leaves v. Constraints (|14p say that u-flow passes through u if and 
only if u occurs before v. Since the t-flow goes through every vertex, when all the variables are in 
{0, 1}, it defines an s-t path. We can think of the t-How as the universal flow and Constraints (|15p 
ensure that every u-flow follows an edge which has a universal flow on it. Constraints (I16p . in an 
integer solution, ensure that if a set S contains some node y that comes before v (i.e. x yv = 1), 
then at least one unit of w-flow enters S. 

We note that a min-cut subroutine can be used to detect violated constraints of type (|16p , allow- 
ing us to solve LP (jHJ) using the ellipsoid method. Our analysis does not actually use Constraints 
(|15p . so we can drop them from the LP. Although without these constraints the corresponding 
integer program may not be an exact formulation for the directed latency problem, we can still find 
a solution whose cost is within factor O (log re) of this relaxed LP. 

Lemma 6.1 Given a feasible solution to LP with objective value L, we can find another solution 
of value at most (1 + in which the ratio of the largest to smallest latency £( ) is at most n 2 . 

Proof. Let (x,£, f) be a feasible solution with value L, with £{t) the largest latency value in this 
solution. Note that L > £(t). Define a new feasible solution (x,£', f) by £'(v) = m&x{£(v), £(t)/n 2 }. 
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s.t. £(v)> J2 d ^fL 

uw 

£(v) > [d su + d uw + d wv ] x uwv Vu,w,v : \{u,w,v}\ = 3 (9) 

£(t) > £(v) Vw 

wj, u : to, «}| = 3 (10) 

2W, + £«™ = 1 \Ju,w:u^w (11) 

z S u = Xut = l Vu ^ {s,i} 

EX = XX V«,Vu (12) 

Vf =Vp =1 vw 

/ j J sw / j J wv 

w w 

fZs = fvu = V Vu, V (13) 

E ^ = Xuv Vv,u^v (14) 

fL<fL v«,tu,« (15) 

E /™>*i/» V5cy\W,yeS (16) 



The total increase in the objective function is at most n ■ -Xf < L/n as there are n nodes in total. 
Thus, the objective value of this new solution is at most (1 + l/n)L. □ 

Using Lemma 16.11 and scaling the edge lengths (if needed) , we can assume that we have a 
solution (x, £, /) satisfying the following: 

Corollary 6.2 There is a feasible solution (x,£, f) in which the smallest latency is 1 and the largest 
latency is at most n 2 and whose cost is at most (1 + — ) times the optimum LP solution. 

Let L* be the value (i.e. total latency) of this solution. 

The idea of our algorithm is to construct s-v paths for several nodes v, such that together 
they cover all vertices of V, and then to "stitch" these paths together to obtain one Hamiltonian 
path. We use our results for ATSPP to construct these paths. For this, we observe that parts of a 
solution to the latency LP can be transformed to obtain feasible solutions to different instances 
of LP(a). For example, we can construct a Hamiltonian s-t path of total length O(logn) • £{t) as 
follows. From a solution to LP ([8]), take the i-flow defined by the variables and notice that it 
constitutes a feasible solution to LP(a = 1). In particular, since x y t = 1 for all y, constraints (|16p 
of LP ([8]) for v = t imply that the set constraints ([5]) of LP (pQ) are satisfied. The objective function 
value for LP ([T|) of this solution is at most £(t). Thus, by Theorem 11.11 we can find the desired 
path. Of course, this path is not yet a good solution for the latency problem, as even nodes v with 
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£(v) <C £(t) can have latency in this path close to O(logn) • £(t). Our algorithm constructs several 
paths of different lengths, incorporating most nodes v into paths of length O(logre) -£(v), and then 
combines these paths to obtain the final solution. 

Algorithm 2 Directed Latency 
1: Let (x,£, f) be a solution to LP (|8|). Let S be the path {s}. 

2: Partition the nodes into g = [log l(t) + lj sets V\, . . . , V g with v G Vi if 2 l ~ l < £(v) < 2\ 
3: for i = 1 to g — 1 do 
4: for j = 1 to 2 do 
5: if ^ then 

6: Let vf = argmax^gy. \{u S Vi : x uv > |}| > this maximizes the size of Bf below 

7: Let M = {u G V : x j>\ + ^ ±i } 

i L uv, — 3 24 log n > 

8: Let Bf = {ueVi-. x uvJ > i} > \B{\ > (\Vi\ - l)/2 

9: Find an s-vj path P^, containing of cost 5\ logn • 2 l ; append P- to S. 

10: Find 21ogn s-vj paths V\, containing Bf, of total cost at most 21ogn • 2'; append 

Vf to S. 

ll: Vi = Vi\(Ai\JBf\J{vf}) > size of Vi is at least halved 

12: end if 

13: end for 

14: Let Vi+i = Vi+i U Vj > remaining nodes are carried over to the next set 

15: end for 

16: Construct an s-t path P g , containing V g , of cost at most (21ogn + 1) • £(t). Append P g to S. 
17: Shortcut S over the later copies of repeated nodes. Output S. 



6.1 Constructing the paths 

Algorithm [2] finds an approximate solution to the directed latency problem, and we now explain 
how some of its steps are performed. The algorithm maintains a path S, initially containing only 
the source, and gradually adds new parts to it. This is done through operation append on lines [9j 
[TOl and [T6j To append a path P to S means to extend S by connecting its last node to the first 
node of P that does not already appear in S, and then following until the end of P. For example, 
if S = sabc and P = sbdce, the result is 5 = sabcdce. Step [10] appends a set of paths to S. This 
just means sequentially appending all paths in the set, in arbitrary order, to S. 

Next we describe how to build paths P- and V\ in Steps [9] and [10J We described above how 
to use Theorem 11.11 to build a Hamiltonian s-t path P of length (21og?i + 1) • £(t), which is used 
on line [16] of the algorithm. The idea behind building paths P- and Vf with their corresponding 
length guarantees is similar. 

To construct P- , we do the following. Since each node u G A\ has x j > 2/3, the amount of 

vj-Row that goes through u is at least 2/3. We apply splitting-off on this flow to nodes outside of 
A?, and obtain a total of one unit of s-vf flow over the nodes in A?, of cost no larger than £{vf) < 2*. 
This flow satisfies all the constraints of LP (a = 2/3), including the set constraints (|5]), which are 
implied by the set constraints (|16p of the latency LP Q, as x j > 2/3 for u G A\. Thus, using 

Theorem 11.21 we can find a path from s to vj, spanning all the nodes of A?, whose cost is at most 
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Si logra • 2 l for some constant <5i. 

To obtain the set of paths V\, we look at the vj-B.ow going through each node of B\, whose 
amount is at least \ . After splitting-off all nodes outside of B\ , we get a feasible solution of cost 
at most i^vj) < 2 % to LP (a = 1/2). By Theorem 11.31 we can find 21ogn paths, each going from s 
to v\ , which together cover all the nodes of B\ , and whose total cost is at most 2 log n ■ 2* . 

6.2 Connecting the paths 

We now bound the lengths of edges introduced by the append operation in the different cases. 
For a path P, let app(P) be the length of the edge used for appending P to the path S in the 
algorithm. 

Lemma 6.3 For any i, j, and path P G app(P) < 6 • 2\ Also, app{P g ) < 6 • 2 9 . 

Proof. Let u be the last node of the path S before the append operation, v\ be the last node of 
P, and w be the first node of P that does not appear in 5. We need to bound d uw , the distance 
from u to w. 

We observe that x wu < 5/6. If u = s, this is trivial. Otherwise, u = v\, is the endpoint of some 
path constructed in an earlier iteration. Note that j' < 2 and i' < g — 1 < \og£(t) < 21ogn by our 
assumption that £(t) < n 2 , which means that | > | + 2 2A\ogn • ^ we Xwu ^ ^/^' then w 

would be included in the set At and in the path Pi , and thus be already contained in S, which is 
a contradiction. 

Consequently, x uw = 1 — x wu > 1/6. This means that the amount of u>-flow that goes through 
u is at least 1/6. Since this flow has to reach w after visiting u, it has to cover a distance of at least 
d uw , thus adding at least | • d uw to £(w), the latency of w. Thus, £(w) > \d uw , and d uw < 6£(w). 
Now, if w G V\, it must be in B\, which, by definition, means that w £ Vi, and therefore £{w) < 2\ 
So app(P) = d uw <6- 2\ If w € P g , then app{P g ) < 6£{w) < 6£{t) < 6 • 2P . □ 

To bound the cost of appending a path P- to S, we need an auxiliary lemma. 
Lemma 6.4 For any e > 0, if x uw + x wv > 1 + e, £/ien > e • d uw . 
Proof. Using Constraint (|10p we have: 

1 6 ^ %uw ~t~ ^uju 

— 2x uwv + (^Cuuu; "I - X uvw ) + {x wuv -\- X wvu ). 

On the other hand, -t- x uvw ^j -\- {x wuv -\- x wvu ) ^ 3^u? ~t~ ^wu — 2 (^Cuw ^wv) — 1 ^? 

using again Constraint (I10|) . then Constraint (jlip . and the assumption of the lemma. Therefore, 
2x uu ,t, > (1 + e) — (1 — e) = 2e, i.e. x uwv > e. Then the claim follows using Constraint Q. □ 

Lemma 6.5 For any i and j, app(P-) < 241ogn • 2*. 

Proof. Let u, v\, and w be as in the proof of Lemma 16.31 To bound d uw , we consider two cases. 
Case 1: If w G Vi, we apply the same proof as for Lemma f6.3l and conclude that app(P-) < 6-2*. 
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Case 2: If w ^ Vi, let (i',f) be an earlier iteration of the algorithm in which node u = vL was 
added to S. Since w ^ 5, it must be that u> ^ A?, , and thus i;„ < | + ^Tog^n • On ^ ne °tber 
hand, since w € A\, it must be that — | + 24 logn • Because 2i' + j' < 2i + j — 1, we have 

i i 

2i' - 2 + ?'' 2i - 2 + j 

> 1 lJ— -| 

24 log n 24 log n 

i + 1 



24 log n 

Using Lemma EU we get that app(P-) = d uw < 24 log n ■ < 24 log n • 2 l . □ 

Lemma 6.6 Suppose that a node v is first added to path S in iteration k of the outer loop of the 
algorithm. Then the latency of v in S is at most 62 log n ■ 2 k , for some constant 62 > 0. 

Proof. Let len(P) denote the length of a path P. The latency of node v on S is at most: 



k 2 

EE 

i=i 3=1 



len(P?) + ^2 len ( p ) + app{Pi) + ^2 a PP( p ) 
k 2 

< ^2Y^ ^i 1 ogra-2 i + 21ogn-2 i + 241ogn-2 i + 21ogn-6-2 i ] 

i=l 3=1 

< 5 2 log n ■ 2 k 

□ 

Suppose that n« is the number of nodes that are originally placed into the set Vi. Since a node 
v is originally placed in Vi if £(v) > 2*" 1 , the value of the LP solution L* can be bounded by: 

L* = ^£(v) > ^n.2- 1 . (17) 

V i=l 

Let n- denote the size of Vi at the beginning of iteration i of the outer loop. Note that n- may 
be larger than rjj since some nodes may have been moved to Vi in Step [TH of the previous iteration. 

Claim 6.7 For any i, the size of the set Vi at the end of iteration i is at most n^/A. 

Proof. Consider the iteration = 1). Note that the vertex v\ is chosen precisely to maximize 
the number of nodes u in V with x j > 1/2, which is the size of the set B] . If we imagine a 

' uv. — ' ' % 

directed graph H on the set of vertices Vi, in which an edge w) exists whenever x uw > 1/2, then 
v\ is the vertex with highest in-degree in this graph. Now, from Constraint (jlip . it's not hard to 
see that some vertex in H will have in-degree at least {p! — l)/2. So the number of nodes removed 
from Vi in step [11] of the algorithm is at least \Bl U {vj}\ > n'/2, and size of Vi decreases at least 
by a factor of two. Similarly, at least half of the remaining nodes of Vi are removed in the iteration 
j = 2, so overall the size of Vi decreases at least by a factor of four. □ 
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We now show that the total latency of the final solution S is at most O(logn) • L*. 

Proof of Theorem 11.51 From Claim loTTl it follows that at most a 1/4 fraction of the n! i nodes 
that are in V{ at the beginning of iteration i are moved to the set Vi+i at the end of this iteration. 
Thus, for any 1 < i < g, n' i <rii + n'^/4. This implies that r n! i < Ylh=i n h/4 l ~ h . 

Now we claim that the total latency of the solution S is at most Y2f=i n i ' ^2 logn • 2\ This is 
because at most n- nodes are added to S in iteration i, and each such node has latency at most 
82 logn • 2 l (using Lemma |6.6|) . Therefore, the total latency of the solution is at most: 

a 9 i 



^n' i -5 2 \ogn-2 i < ^<5 2 logra-2 



QjL—h 

1 h=l 



5 2 logn^^2 ft --2 fc n h 
i=i h=i 
9 



< 5 2 logn^2 /l n h ^- 

h=l 

< 0(logn)-L*, 



2 l 

h=l i=0 



using the bound on re-ordering the summation, and using inequality (|17p . Combined with 
Corollary 16.21 this proves the theorem. □ 



6.3 Extensions 

The above algorithm can be easily extended to the more general setting in which every node of 
the graph comes with a weight c(v) and the goal is to find a Hamiltonian s-t path to minimize the 
total weighted latency, where the weighted latency of a node Vi is equal to c(v) ■ S}=i d VjVj+1 . This 
requires changing the objective function of LP ([8]) to ^2 V ^ S c(v)£(v) and changing the definition of 

uj on line [6] of Algorithm [2] to maximize the total weight, instead of the number, of vertices in Bf. 

We also note that our approximation guarantee for directed latency is of the form 0(7 + 
logn), where 7 is the integrality gap of ATSPP. So an improvement of the bound on 7 would not 
immediately lead to an improvement for directed latency. 
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